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1, INTRODUCTION 

Hour-ahead electricity price prediction is crucial to market participants in deregulated electricity 
market to produce an appropriate bidding plan where the quantity of bids can be revised or changed prior to 
the dispatch hour. However, only a few studies have been conducted in the field of hour-ahead price 
forecasting. This is because most power markets run two-settlement market structure (day-ahead and real 
time) or standard market design rather than single-settlement system (real time). 

Previous researcers proposed various methods such as time series model of Multivariate Adaptive 
Regression Splines (MARS) [1], Levenberg-marquardt (LM) back propagation [2], and Input-Output Hidden 
Markov Model (IOHMM) [3]. Meanwhile, a hybrid method are also developed such as recurrent neural 
networks (RNN) and excitable dynamics [4] and a hybrid model of Autoregressive Moving Average 
Exogenous (ARMAX), adaptive wavelet neural network (AWNN), and Generalized Autoregressive 
Conditional Heteroskedasticity (GARCH) to treat linear and nonlinear structures of price series [5]. Other 


Journal homepage: http://iaescore.com/journals/index.php/ijeecs 


Indonesian J Elec Eng & Comp Sci ISSN: 2502-4752 O 749 


Neural Network (NN) techniques were also modelled such as Expectation Maximization (EM) technique for 
maximum likelihood estimation of RNN (RNN-EM) [6], Multi-layer Perceptron NN trained by Extended 
Kalman Filter (MLP-EKF) and EM (MLP-EM) [7] and Extended Kalman Filter for RNN (RNN-EKF) [8]. A 
Generalized Regression Neural Network (GRNN) was developed by [9] and Discrete Cosine Transforms 
Input Featured Feed-Forward Neural Network (DCT-FFNN) model was proposed by [10]. The same 
researchers further improved the forecast by creating classification models using three layered FFNN, 
Cascade-Forward Neural Network (CFNN) trained by the LM algorithm, and GRNN models [11]. 

Most existing techniques have good predictions during normal circumstances or without a surge 
event; but when spikes are present, forecast predictions become large. Hence, this study introduces a new 
technique in electricity price forecast by developing hour-ahead electricity price forecasting model with Least 
Square Support Vector Machine (LSSVM) and Bacterial Foraging Optimization Algorithm (BFOA). BFOA 
has a fast convergence [12] and has been explored in many fields such as face recognition [13], [14], 
biometric authentication [15], multimodal function [16], [17], and flexible manufacturing systems (FMS) 
[18]. Furthermore, researchers in control and power system developed BFOA models for Static Synchronous 
Series Compensator (SSSC) Damping Controller Design [19], robotic manipulator workspace optimization 
[20], three phase induction motor and electricity load forecasting [28], [34]. To the best of the authors’ 
review, no literature has been found on the combination of LSSVM and BFOA in the electricity price 
forecast. Furthermore, the approach of multistage feature and parameter selections using a single 
optimization method has not been investigated yet. With a single optimization method of BFOA, the input 
features and LSSVM parameters are simultaneously optimized through five-stage optimization approach. 
This method is shown to provide better prediction accuracy compared to most existing models, which can 
contribute for decision-making and hourly market operation. 


2. TOPOLOGY OF SVM, LSSVM AND BFOA 
This section provides topologies of SVM, LSSVM and BFOA which were applied in this study. 


2.1.SVM and LSSVM 

SVM can reduce over-fitting, local minima problems [26], and able to deal with high dimensional 
input spaces splendidly. However, the main drawback of SVM is the high computational complexity due to 
constrained optimization programming. Hence, Least Squares Support Vector Machine (LSSVM) has been 
proposed to reduce the SVM computational burden, which applies with equality rather than the inequality 
constraints. LSSVM solves a system of linear equations to cater Quadratic Programming (QP) issues that 
increase computational speed [27], [28]. The linear system, namely as Karush- Kuhn-Tucker (KKT), is 
simpler than QP system. LSSVM also keeps the SVM principle, which has good generalization capability. 
LSSVM reduces the Sum Square Errors (SSEs) of training data sets and concurrently diminishing margin 
error. The LSSVM model for regression is represented as in (1): 


N 
f=) a, K(x,%,) +d (1) 
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2.2. BFOA 

The E.coli bacteria, which is present in human's intestines has unique foraging activities during 
locating and ingesting nutrient or food. BFOA imitates this mechanism through four main steps during 
foraging; namely, chemotaxis, swarming, reproduction, and elimination-dispersal. The flow of BFOA applied 
in this work is shown in Figure 1. 

In the chemotaxis step, bacteria look for nutrients to maximize the energy intake while foraging by 
taking small steps (chemotaxis) and interacting with other bacteria by sending attractant signal to form 
flocks; or repellent signal to move individually. They tumble or swim to search nutrient but keep away from 


Fakes 
unsafe places. Therefore, suppose that OG KD is the i-th bacterium position at j-th chemotactic, k-th 
reproduction, and /-th elimination-dispersal step, the position of each bacterium after swimming or tumbling 
can be defined as (2): 
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Figure 1: Flowchart of LSSVM-BFOA Model Development 


Where C(i) is the measure of the step taken during tumbling or swimming, and A is the vector in a 
random direction where the elements lie in position [-1,1]. The objective function or actual cost for every 
location of bacterium 7 is calculated and represented as J(i,j,k,/). During swarming step, a bacterium that has 
found a good nutrient source during its search may attract other bacteria to form flocks. Instead, the repellent 
signal may be released to ensure that the bacteria are not too close to each other. The cell-to-cell attraction 
and repellent of E.Coli swarm can be represented as Joc, the objective function value to be added to the 
current objective function which will decrease the final objective function. When food is sufficient and the 
temperature is appropriate, the healthiest or good bacteria will grow in length and break in the middle to form 
the self-replicating which contributes to the next generation while the least healthy bacteria die. This activity 
is known as reproduction. Thus, BFOA uses this phenomenon by structuring the best objective function in 
the ascending order and maintaining half of the population size to reproduce while the other half is 
eliminated. The last step is the elimination-dispersal where the chemotactic process can be dissolved and the 
bacteria spread to new positions when a sudden change in the environment exists. 


3. RESEARCH METHOD 

In Ontario, the electricity market is operated by the Independent Electricity Systems Operator 
(IESO), which controls the operation of power systems, predicts short-term demand and electricity supply, 
and manages real time market electricity prices. Due to the single settlement real-time power market, Ontario 
is reported to be one of the most volatile markets in the world and hence it is a big challenge for electric price 
forecasters. [29]. Appropriate selection of features affects the efficiency and accuracy of predictions. The 
input features used in this study are as in [30], where correlation analysis is performed to observe the 
significant features for forecasting. The total features are [(15 days x 24 hours price) + (15 days x 24 hours 
demand) + 1-hour pre-dispatch price = 721]. Noted that this method 1s an initial process to filter or reduce the 
number of features to be optimized by BFOA. Hybrid model of LSSVM-BFOA was developed with five- 
stage optimization of feature and parameter. The flow of BFOA applied in this work is shown in Figure 1. 

During the first stage, all 721 features are applied and the BFOA selects certain number of 
significant features to be fed into the LSSVM. At the same time, BFOA optimizes the LSSVM parameters; 
gamma (y) and sigma (o). During the second stage of optimization, BFOA optimizes the features and 
parameters that have been selected from the first stage of optimization. These steps are repeated for the next 
stage of optimization until no improvement has been observed in the fitness value or Mean Absolute 
Percentage Error (MAPE). MAPE and Mean Absolute Error (MAE) are expressed as in (3) and (4); 
respectively, where Pactual and Prorecast are the actual and forecasted HOEP at hour ¢, respectively, while N is 
the number of hours. 
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4. RESULTS AND ANALYSIS 

In comparison with previous researchers, six predictive models were developed to represent 
throughout 2004. Each model is trained with ten weeks of training samples prior to the forecasting week as 
presented in [30]. Table 1 presents the result for all test weeks and optimization stages. It can be noted that 
the average MAPE decreases after each level of optimization. The best MAPEs are obtained during the fifth 
stage of optimization. Table 2 reveals the network configurations for all test weeks during the fifth stage of 
optimization. The BFOA parameters must be chosen properly by trial and error method [31], [32], [33]. 

The main optimization process occurs during chemotaxis activity where the objective function is 
calculated for each bacterium. Too small value of N. may trap the bacteria into local minima. N, value must 
be smaller than N, value. Although the swimming activity occurs in chemotaxis loop, the swimming counter 
will be terminated if the MAPE produced is greater than the previous MAPE. The value of Pea 1s set as 0.25 
since too large value can increase computational burden due to an extensive search. Meanwhile, the Nre 
should not be too small as it may cause premature convergence. As in general, increasing the size of S, Nea, 
Nye, and N; may increase the computational burden, but hopefully it may improve the optimization process 
since bacteria have a wider search space. Furthermore, the developed models of LSSVM-BFOA are 
compared with other existing models as tabulated in Table 3. Based on the observation of Table 3, the 
LSSVM-BFOA model shows better result than other available models except for LSSVM+GA [30] and 
RNN with excitable dynamics [4] models. However, the LSSVM-BFOA model is considered as comparable 
since the differences of MAPE are only 0.55% [4] and 3.6% [30]. In comparison, the RNN model has a more 
complex structure, which is designed to handle spiky and non-spiky price regions. 


Table 1. MAPE of LSSVM-BFOA for Hour-Ahead Forecast 


Optimization Stage SAE?) 
Week 1 Week 2 Week 3 Week 4 Week 5 Week 6 Average 
Stage | 16.58 15.30 12.51 26.89 17.36 21.67 18.39 
Stage 2 13.53 13.01 10.69 19.92 15.83 14.48 14.58 
Stage 3 11.21 10.82 7.99 13.94 14.91 11.90 11.80 
Stage 4 13.53 10.15 7.10 11.63 13.43 10.76 11.11 
Stage 5 11.87 10.31 8.55 11.21 13.30 10.78 11.00 
Stage 6 11.79 10.34 8.40 11.68 13.25 11.20 11.11 
Table 2. Selected configuration of LSSVM-BFOA for Hour-Ahead Forecast 
Test data 
Week 1 Week2 Week3 Week4 Week5 Week 6 
S 20 20 20 20 16 20 
Ne 50 100 50 50 50 50 
Ng 5 
Nee 4 
Nea 2 
Ped 0.25 
Gamma S21 15.72 16.22 18.53 0.85 1.54 
Sigma 0.99 2.24 7.62 10.30 1.07 2.98 
Selected Features 17 22 22 16 22 22 
Regression (R) 0.81 0.89 0.88 0.93 0.80 0.85 
MAE 5.54 5.01 4.48 5.00 9.37 7.18 
Table 3. MAPE for Hour-Ahead Forecast in the Ontario Electricity Market 
Test week Average 
Ref. Year Method 1 5 3 4 5 6 MAPE 
LSSVM-BFOA 11.87 10.31 8.55 11.21 13.30 10.78 11.00 
[30] 2016 LSSVM-GA els) 7.45 555 7.88 724 8.77 7.40 
[4] 2013. RNN + excitable dynamics 10.76 9.12 11.61 10.45 
RNN - Expectation Maximization 
algorithm (RNN-EM) 15.09 15.16 10.52 10.21 15.78 15.71 13.72 
RNN - Extended Kalman Filter 
[6] 2011 (RNN-EKF) 16.01 16.54 11.89 11.96 16.59 16.45 14.91 
MLP-EKF 16.83 16.74 12.64 15:25 16.77 16.96 15.87 
MLP-EM 15.48 15.39 11.87 12.07 16.78 16.73 14.72 
1 006 MARS (case 1) 13.3 12.9 9.4 14.4 12.9 15.5 13.07 
MARS (case 2) 12.5 12:3 8.6 11.7 11.8 13.9 11.80 
IESO 23.78 25.26 10.41 16.22 22.06 23.51 20.21 
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5. CONCLUSION 

In the field of electricity price forecasting, accuracy of the prediction is the main issue. 
Nevertheless, predictive models with high precision usually need a more complex model structure. Apart 
from that, selection of features and parameter are also important tasks during the forecast model 
development. Hence, a hybrid model of LSSVM-BFOA for hour-ahead electricity price forecast was 
developed in this study. BFOA completes both feature and LSSVM parameter optimizations simultaneously. 
Through several levels of optimization, the amount of inputs to be fed into the LSSVM structure will be 
reduced and at the same time the value of the LSSVM parameter is refined. Although the accuracy of the 
LSSVM-BFOA is slightly lower than the previous best models, the developed model shows simpler structure 
and provides better MAPE than most of the existing models in the Ontario power market. In addition, until 
recently, no study has investigated the application of BFOA in electricity price forecasting. This contribution 
can help market members to bid effectively, maintain efficient daily operations, and ultimately increase the 
company's profits. Therefore, some refinement and modification on the LSSVM-BFOA model could be 
performed in future to reduce the forecast error. The forecast accuracy may be enhanced by proper selection 
of their parameters such as the number of bacteria (S), number of chemotactic steps (N.), number of steps 
taken during swimming (Ns), number of reproduction steps (Nye), number of elimination-dispersal steps (Nea), 
probability of elimination-dispersal (pea), attractant depth (dattract), and attractant width (Wattract). 
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